Information Gain Sampling for Active Learning in Medical Image Classification

نویسندگان

چکیده

Large, annotated datasets are not widely available in medical image analysis due to the prohibitive time, costs, and challenges associated with labelling large datasets. Unlabelled easier obtain, many contexts, it would be feasible for an expert provide labels a small subset of images. This work presents information-theoretic active learning framework that guides optimal selection images from unlabelled pool labeled based on maximizing expected information gain (EIG) evaluation dataset. Experiments performed two different classification datasets: multi-class diabetic retinopathy disease scale skin lesion classification. Results indicate by adapting EIG account class-imbalances, our proposed Adapted Expected Information Gain (AEIG) outperforms several popular baselines including diversity CoreSet uncertainty maximum entropy sampling. Specifically, AEIG achieves $${\sim }95\%$$ overall performance only 19% training data, while other approaches require around 25%. We show that, careful design choices, model can integrated into existing deep classifiers.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Medical Image Classification Using Information Gain for Global Feature Reduction

Medical image classification and retrieval systems have been finding extensive use in the areas of image classification according to imaging modalities, body part and diseases. One of the major challenges in the medical classification is the large size images leading to a large number of extracted features which is a burden for the classification algorithm and the resources. In this paper, it i...

متن کامل

Active Learning for Classification of Medical Signals

Hospitals are increasingly capturing and storing medical signals produced by patients. Machine learning classifiers could be used on these signals to improve patient care, but while unlabeled data is often plentiful, obtaining labels for such a large quantity of data is prohibitively expensive. Active learning attempts to achieve good classification performance with only a minimal number of lab...

متن کامل

Active Learning with Ensembles for Image Classification

In many real-world tasks of image classification, limited amounts of labeled data are available to train automatic classifiers. Consequently, extensive human expert involvement is required for verification. A novel solution is presented that makes use of active learning combined with an ensemble of classifiers for each class. The result is a significant reduction in required expert involvement ...

متن کامل

Active Learning with Ensembles for Image Classification

In many real-world tasks of image classification, limited amounts of labeled data are available to train automatic classifiers. Consequently, extensive human expert involvement is required for verification. A novel solution is presented that makes use of active learning combined with an ensemble of classifiers for each class. The result is a significant reduction in required expert involvement ...

متن کامل

Active Learning for Hyperspectral Image Classification

Obtaining labeled data for supervised classification of remotely sensed imagery is expensive and time consuming. Further, manual selection of the training set is often subjective and tends to induce redundancy into the supervised classifier, thus considerably slowing the training phase. Active learning (AL) integrates data acquisition with the classifier design by ranking the unlabeled data to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-16749-2_13